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Abstract 

Planning is a natural domain of application 
for frameworks of reasoning about actions and 
change. In this paper we study how one such 
framework, the Language £, can form the ba- 
sis for planning under (possibly) incomplete in- 
formation. We define two types of plans: weak 
and safe plans, and propose a planner, called the 
£ -Planner, which is often able to extend an ini- 
tial weak plan into a safe plan even though the 
(explicit) information available is incomplete, e.g. 
for cases where the initial state is not completely 
known. The f-Planner is based upon a reformu- 
lation of the Language £ in argumentation terms 
and a natural proof theory resulting from the re- 
formulation. It uses an extension of this proof the- 
ory by means of abduction for the generation of 
plans and adopts argumentation-based techniques 
for extending weak plans into safe plans. We pro- 
vide representative examples illustrating the be- 
haviour of the ^-Planner, in particular for cases 
where the status of fluents is incompletely known. 



Introduction 

General formalisms of action and change can provide a 
natural framework for the problem of planning. They 
can offer a high level of expressivity and a basis for the 
development of general purpose planning algorithms. 

We study how one su c h formalism, the Langu age £ 
( |Kakas fc Miller 1997bj ; |Kakas fc Miller 1997a|) , can 
form a basis f or planning. To do this we exploit the 
refo 



mulation flKakas, Miller, k Toni 199S| ) of the-Ls 



gna.jrp £ into an argumentation frampwork and the a.s- 

sociatcd proof theory offered by this reformulation. A 
simple extension of this argumentation-based proof the- 
ory with abduction forms the basis of planning algo- 
rithms within the framework of the Language £. 

In this paper we will be particularly interested in ad- 
dressing the specific problem of planning under incom- 
plete information. This amounts to planning in cases 
where some information is missing, as for example when 
we do not have full knowledge of the initial state of the 
problem. In general, we assume that this missing in- 
formation cannot be "filled in" by additional actions in 



the plan as it may refer to properties that cannot be 
affected by any type of action in the theory or to an 
initial time before which no actions can be performed. 
Instead, the planner needs to be able to reason despite 
this incompleteness and construct plans where this luck 
of information does not matter for achieving the final 
goal. 

We define a planner, call the ^-Planner, which is able 
to solve this type of planning problems with incomplete 
information. It works by first generating a conditional 
plan based on one possible set of arguments in the cor- 
responding argumentation theory of the planning do- 
main. These plans are called weak plans and may not 
be successful under every possibility for the missing in- 
formation. The planner then uses further argumenta- 
tion reasoning to extend the weak plan to a safe plan 
which is able to achieve the planning goal irrespective 
of the particular status of the missing imformation. 

Planning und er incomplete information is a relatively 
new topic. In ( Finzi, Pirri, fc Reiter 19*9^) this prob- 
lem is called "Open World Planning" and is studied 
within the framework of the situation calculus. The 
incomplete information refers to the initial situation 
of the problem and a theorem prover is used to rea- 
son about properties at this situation. Other related 
work on planning within formal frameworks for reason- 
ing about actions and change is ( Levesque 1996 ), which 
defines a notion of conditional plans, (phanahan 1997| ; 



Denecker, Missiaen, & Bruynooghe 1992), with a for 
mulat ion of abductive planning in the even t calculus 
and (Dimopoulos, Nebel, & Kochler 1997; Lifschitz 
1999 ) , which study "answer set planning" within ex- 
tended logic programming. 

A Review of the Basic Language S 

The Language £ is really a collection of languages. The 
particular vocabulary of each language depends on the 
domain being represented, but always includes a set of 
fluent constants, a set of action constants, and a par- 
tially ordered set (II, ;<} of time-points. For this paper 
where we arc interested in linear planning we will as- 
sume that ^ is a total order. A fluent literal is either a 
fluent constant F or its negation ->F. 



Domain descriptions in the Language £ are collec- 
tions of statements of three kinds (where A is an ac- 
tion constant, T is a time-point, F is a fluent con- 
stant, L is a fluent literal and C is a set of flu- 
ent literals): t-propositions ("t" for "time-point"), of 
the form L holds-at T; h-propositions ("h" for "hap- 
pens"), of the form A happens-at T; c-propositions 
("c" for "causes"), of the form A initiates F when C 
or A terminates F when C. When C is empty, the 
c-propositions are written as U A initiates F" and 
11 A terminates F" , resp. 

The semantics of £ is based on simple defini- 
tions of interpretations, defining the truth value of t- 
propositions a t each particular time-p o int, and models 
Briefly, (see flKakas k Miller 1997a|; |Kakas & Miller 



199|bj) for more details) these are given as follows 

• An interpretation is a mapping H : $ x II i— > 
{true, false}, where <E> is the set of fluent constants 
and II is the set of time-points in £. Given a set of 
fluent literals C and a time-point T, an interpretation 
H satisfies CatTiff for each fluent constant F G C, 
H(F, T) = true, and for each fluent constant F' such 
that ->F' S C, H(F',T) = false. 

• Given a time-point T, a fluent constant F and an in- 
terpretation H , T is an initiation-point {termination- 
point resp.) for F in H relative to a domain descrip- 
tion D iff there is an action constant A such that (i) D 
contains both an h-proposition A happens-at T and 
a c-proposition A initiates (terminates, resp.) F 
when C, and (ii) H satisfies C at T . Then, an inter- 
pretation H is a model of a given domain description 
D iff, for every fluent constant F and time-points 
Tt < T 3 : 

1 . If there is no initiation- or termination-point T 2 for 
F in H relative to D such that Ti < T 2 -< T 3 . then 
H(F,Ti) = H(F,T 3 ). 

2. If Ti is an initiation-point for F in H relative to 
D, and there is no termination-point T2 for F in 
H relative to D such that Ti -< T 2 -< T 3 , then 
H(F,T 3 ) = true. 

3. If T\ is a termination-point for F in H relative 
to D, and there is no initiation-point T 2 for F in 
H relative to D such that Ti -< T 2 -< T 3 , then 
H(F,T S ) = false. 

4. For all t-propositions F holds-at T in 13, 
H(F,T) = true, and for all t-propositions "^F 
holds-at T'" in TJ>, H{F,T) = false. 

• A domain description D is consistent iff it has a 
model. Also, D entails (written f=) the t-proposition 
F holds-at T (->F holds-at T, resp.), iff for every 
model H of D, H(F, T) = irwe (JT(F, T) = /a^se, 
resp.). 

Note that the t-propositions, in effect, are like "static" 
constraints that interpretations must satisfy in order 
to be deemed models. We can extend the language £ 
with ramification statements, called r-propositions, of 



the form L whenever C, where L is a fluent literal and 
C is a set of fluent literals. These are also understood 
as constraints on the interpretations, but with the dif- 
ference of being "universal" , i.e. applying to every time 
point. Formally, the definition of a model is extended 
with: 

5. For all r-propositions L whenever C, in D, and for 
all time-points T, if H satisfies C at T then H 
satisfies {L} at T. 

In addition, the complete formalization of ramification 
statements requires a suitable extension of the defini- 
tions of initiation- and ter mination-point. The in ter- 
ested reader is refered to (Kakas & Miller 1997a) for 
the details. 

As an example, consider the following simple "car 
engine domain" D c , with action constants TurnOn and 
Empty and fluents Running and Petrol: 

TurnOn initiates Running when {Petrol} (D c l) 

Empty terminates Petrol (D c 2) 

TurnOn happens-at 5 (-D c 3) 

Petrol holds-at 1 (T) c 4) 

It is easy to see, for example, that D c entails Running 
holds-at 7 and that D c extended via the h-proposition 
Empty happens-at 3 does not. 

Planning with £ 

The language £ with its explicit reference to actions as 
h-propositions in its basic ontology is naturally suited 
for the problem of planning. Let a goal be a set of t- 
propositions. Then, given a domain description D and 
a goal G, planning amounts to constructing a set A of 
h-propositions such that D U A entails G. 

In general, however, the extension of D via the plan 
A might be required to respect some given precondi- 
tions for the actions in A. These preconditions can 
be represented by a new kind of statements, called p- 
propositions ("p" for "preconditions"), of the form A 
needs C, where A is an action constant and C is a 
non-empty set of fluent literals. Intuitively, the fluents 
in C are conditions that must hold at any time that the 
action A is performed. Note that, alternatively, pre- 
conditions could be encoded via additional conditions 
in c-propositions already appearing in the domain de- 
scriptions. The use of p-propositions is though simpler 
and more modular. 

Definition 1 An (£-)planning domain is a pair 
(D,P), where D is a domain description and P is a 
set of p-propositions. 

The semantic interpretation of the new type of sen- 
tences is that of integrity constraints on the domain 
descriptions. 

Definition 2 Given a planning domain (D, P) , D sat- 
isfies P, written D \= P , iff for all p-propositions A 
needs C in P, and for all h-propositions A happens- 
at T in D, D entails C(T), where C(T) denotes the set 



of t-propositions obtained by transforming every fluent 
literal in C into the respective t- proposition at T . 

The planning problem is then defined as follows. 

Definition 3 Given a planning domain (D, P) and a 
goal G, a (safe) plan for G in D is a set A of h- 
propositions such that D U A is consistent and : 

• D U A h G, 

• DU A |= P. 

Note that the initial state of the planning problem is 
assumed to be contained in the given domain descrip- 
tion, and might amount to a set of t-propositions at 
some initial time point, or, more generally a set of t- 
propositions over several time points, not necessarily all 
coinciding with a unique initial time point. 

The above definition of (safe) plan provides the for- 
mal foundation of the ^-planner. It is easy to see that, 
through the properties of the model-theoretic seman- 
tics of £ , a safe plan satisfies the requirements that (i) 
it achieves the given goal, and (ii) it is executable. 

As an example, let us consider the simple "car en- 
gine planning domain" (D' C ,P C ), with D' c consisting of 
statements (D c l), (D c 2) and (D c 4) from the previous 
section as well as: 



Fill initiates Petrol 
-^Running holds-at 1 

and P c consisitng of the p-proposition 

Fill needs -^Running 



(D c 5) 
(D c 6) 



Let the goal be G — Running holds-at Tf for some 
(final) time Tf. Then, a plan for G is given by the set 
Ai = {TurnOn happens-at Ti} where T\ -< Tf. This 
is a safe plan in the sense that if we add Ai to D' c , 
then both the goal G and the p-proposition in P c are 
entailed by the augmented domain. 

Consider now the domain D" obtained from D' c 
by removing (D C A). Note that then D" has in- 
complete (initial) information about Petrol. Then, 
the above plan Ai is no longer a safe plan for 
G as there is no guarantee that the car will have 
petrol at the time T\ when the TurnOn action is as- 
sumed to take place. A safe plan is now given by 
A2 = { TurnOn happens-at T\, Fill happens-at T2} 
with T 2 < Ti ~< T f . In the context of D", the origi- 
nal plan Ai will be called a weak plan. A weak plan is 



Ai by adding the additional action Fill happens-at T2, 
ensuring that A is entailed by D" U A2 . 

Definition 4 Given a planning domain (D, P) and a 
goal G, a weak plan for G in D is a set A of h- 
propositions s.t. there exists a model M o/DUA where: 

• M |= G, and 

• M \= P. 

A is conditional or depends on the set of assump- 
tions A iff A is a set of t-propositions such that A is 
not a safe plan for G in D but it is a safe plan for G 
in DUA. 

Note that a safe plan is always a weak plan and that if 
a weak plan is not conditional on any assumptions then 
it is necessarily a safe plan. 

Computing conditional weak plans will form the basis 
for computing safe plans in the ^-planner that we will 
develop in section . In general, if we have a weak plan 
for a given goal G, conditional on a set of assumptions 
A, then the original planning problem for G is reduced 
to the subsidiary problem of generating a plan for A. 
In effect, this process allows to actively fill in by further 
actions the incompleteness in the domain description. 

However, in some cases we may have incomplete in- 
formation on fluents that can not be affected by any 
further actions or at a time point (e.g. an initial time 
point) before which we cannot perform actions. In this 
paper we will concentrate on incompleteness of this 
kind, and we will study how to appropriately generate 
safe plans from weak plans despite the luck of informa- 
tion. 

Of course, this may not always be possible, but there 
are many interesting cases, such as the following "vac- 
cine" domain D v , where a safe plan exists: 

InjectA initiates Protected when {TypeO} (D v l) 
InjectB initiates Protected when {->TypeO} (D v 2) 

Here, the fluent TypeO cannot be affected by any of 
action (we cannot change the blood type) and although 
its truth value is not known (we have incomplete infor- 
mation on this) we can still generate a safe plan for the 
goal, G = Protected holds-at Tf, by performing both 
actions InjectA and InjectB before time Tf. 



An Argumentation Formulation of £ 

Argumentation has recently proved to be a unifying 
mechanism for most existin g non-monoto nic for malisms 
dBoridarenko et al. i997| |Dung 1995| ). In flKakas " 



a se of h-propositions such that the extension it forms Miller, fc Toni 1999f), we h ave adapted the LPwNF 



of the given domain description might not entail the 
given goal, but there is at least one model of the aug- 
mented domain description in which the goal holds true. 
A weak plan depends upon a set of assumptions, in the 
form of t-propositions, such that, if these assumptions 
were true (or could be made true) then the weak plan 
would be (or would become) a safe plan. In the example 
above Ai is weak as it depends on the set of assump- 
tions A = {Petrol holds-at Ti}. A2 is obtained from 



( |Dimopoulos fc Kakas 1995 ) argumentation framework 
to provide an equivalent reformulation of the original 
Language £ presented in section and to develop a 
proof theory for computing entailment of t-propositions 
in domain descriptions. This will form the computa- 
tional basis for our f-Planner. In this section, we give 
a brief review of the argumentation formulation for £ 
concentrating on the methods and results that would 
be needed for the ^-Planner. 



Let a monotonia logic be a pair {£., h) consisting of 
a formal language C (equipped with a negation opera- 
tor -■) and a monotonic derivability notion h between 
sentences of the formal language. Then, an abstract ar- 
gumentation program, relative to (C, h)), is a quadruple 
(B,A,A',<) consisting of 

• a background theory B, i.e. a (possibly empty) set of 
sentences in C, 

• an argumentation theory A, i.e. a set of sentences in 
C (the argument rules), 

• an argument base A' C A, and 

• a priority relation, < on the ground instances of the 
argument rules, where <f> < tp means that (f> has lower 
priority than ip. 

Intuitively, any subset of the argument base can serve 
as a non-monotonic extension of the (monotonic) back- 
ground theory, if this extension satisfies some require- 
ments. The sentences in the background theory can be 
seen as non-defeasible argument rules which must be- 
long to any extension. One possible requirement that 
extensions of the background theory must satisfy is that 
they are admissible, namely that they are: 

• non- self- attacking and 

• able to counterattack any (set of) argument rules at- 
tacking it. 

Informally, a set of argument rules from A attacks an- 
other such set if the two sets are in conflict, by deriving 
in the underlying logic complimentary literals A and -iA, 
respectively, and the subset of the attacking set (mini- 
mally) responsible for the derivation of A is not overall 
lower in priority than the subset of the attacked set 
(minimally) responsible for the derivation of -iA. A set 
of rules A is of lower priority than another set B if it 
has a rule of lower priority than some rule in B and 
does not contain any rule of higher priority than some 
rule in B. 

Then any given sentence a of C is a credulous (scep- 
tical, resp.) non-monotonic consequence of an argu- 
mentation program iff B U A h a for some (all, resp.) 
maximally admissible extension(s) A of the program. 

A domain description D without t-propositions 
can be translated into an argumentation program 
(B(D),As,As',<s), referred to as Pe(D), such that 
there is a one-to-one correspondance between: 

i) models of D and maximally admissible sets of ar- 
guments of Ps(D); 

ii) entailment in £ and sceptical non-monotonic con- 
sequences of Ps(D). 

These equivalence results continue to hold when D con- 
tains t-propositions or r-propositions by simply consid- 
ering only the admissible sets that confirm the truth of 
all such propositions in D. 

The basic elements of the translation of domain de- 
scriptions D into argumentation programs P$(D) are 
as follows. All individual h- and c-proposition transla- 
tions as well as the relationships between time-points 



are included in the background theory B(D), so that 
for all time-points T, T" and action constants A, 

• B(D) b T <T' iff T <T 

• B(D) h HappensAt(A,T) iff A happens-at T is in 

D, 

• for each c-proposition 

A initiates F when {L\, . . . , L n } in D 
(resp. A terminates F when {L\, . . . , L n }), 
B(D) contains the rule 

Initiation(F,t) <— HappensAt(A,t),A(Li),. . . ,A(L n ) 
(Termination(F,t) <— Happens At(A,t),. .. ,A(L n ) 
resp.), where A(Li) = (-) Holds At(Fi,t) if = 
(~^)Fi, for some fluent constant Fi. 

As an example, consider the domain description 
D c in section . Then, B(D C ) contains the fact 
Happens At(TurnOn, 5) and B(D C ) contains the rules 

Initiation(Running, t) <— 

Happens At(TurnOn, t)floldsAt(Petrol, t) 

Termination(Petrol,t) HappensAt(Empty,t). 
The remaining components of Pg(D) are independent 
of the chosen domain D: 

• Ag consists of 
Generation rules: 

Holds At(f, t 2 )<-Initiation(f, h), t x <t 2 (PG[f, t 2 ;t x ]) 
^HoldsAt(f, t 2 )^-Termination(f , t x ), h-4 2 (NG[f ,t 2 ;t x ]) 

Persistence rules: 

HoldsAt(f, t 2 )<- Holds At(f, t x ),t x -<t 2 (PP[f, t 2 ;t x ]) 
->HoldsAt(f, t 2 )<—>HoldsAt(f, t x ), t x -<t 2 (NP[f, t 2 ;h]) 

Assumptions: HoldsAt(f,t) (PA[f,t)) 
-nHoldsAt(f,t) (NA[f,t\) 

• As' consists of all the generation rules and assump- 
tions only. 

• <£ is such that the effects of later events take pri- 
ority over the effects of earlier ones. Thus per- 
sistence rules have lower priority than "conflicting" 
and "later" generation rules, and "earlier" gener- 
ation rules have lower priority than "conflicting" 
and "later" generation rules. In addition, assump- 
tions have lower priority than "conflicting" genera- 
tion rules. For example, given the vocabulary of D c 
in section , PA[Running, 5] <£ NG[Running,b;3] 
and NG[Running, 7; 3] <£ PG[Running , 7; 5]. 

Given this translation of the language £ a proof 
theory can be developed by adapting the ab- 
str act, argumentation- based computational framework 
in ( Kakas fc Toni 1999 ) to the argumentation programs 
Ps (D) . The resulting proof theory is defined in terms of 
derivations of trees, whose nodes are sets of arguments 
in As attacking the arguments in their parent nodes. 
Let So be a (non-self-attacking) set of arguments in As 
such that B(D) U S a h (-)HoldsAt(F,T), for some lit- 
eral (-■) F holds-at T that we want to prove to be 
entailed by D (So can be easily built by backward rea- 
soning). Then, two kinds of derivations are defined: 



- Successful derivations, building, from a tree consist- 
ing only of the root Sq, a tree whose root S is an 
admissible subset of As' such that S 2 Sq. 

- Finately jailed derivations, guaranteeing the absence 
of any admissible set of arguments containing Sq. 

Then, the given literal is entailed by D if there exists a 
successful derivation with inital tree consisting only of 
the root Sq and, for every set S of argument rules in 
As' such that B(D) U S' derives (in h the complement 
of the given literal, every derivation for S is finitely 
failed. 

This method is extended in the obvious way to handle 
conjunctions of literals rather than individual literals by 
choosing Sq and S' appropriately. Also when a domain 
D contains t-propositions we simply conjoin these to 
the literals Sq and S . A similar extension of requiring 
that all the r-propositions are satisfied together with 
the goal at hand is applied for the domains containing 
such ramification statements. 

The details of the derivations are not needed for the 
purposes of this paper. Informally, both kinds of deriva- 
tion incrementally consider all attacks (sets of argu- 
ments in As) against Sq and, whenever the root does 
not itself counterattack one of its attacks, a new a new 
set of arguments in As' that can attack back this at- 
tack is generated and added to the root. Then, the 
process is repeated, until every attack has been coun- 
terattacked successfully (successful derivation) by the 
extended root or until some attack cannot be possibly 
counterattacked by any extension of the root (finitely 
failed derivations) During this process, the counterat- 
tacks are chosen in such a way that they do not attack 
the root. For example, for the domain D c in section , 
given Sq = {PG[Running, 7; 5], PA[Petrol, 5]}, mono- 
tonically deriving Holds At{Running,l), a successful 
derivation is constructed as follows: 

So Sq So 



So is attacked by K = {NA[Petrol, 5]}, trivially coun- 
terattacked by Sq itself. Thus, in this simple example, 
no extension of the root is required. 

The ^-Planner 

The argumentation-based techniques discussed in the 
previous section can be directly extended to compute 
plans for goals. (In the sequel, we will sometimes mix 
the original Language E formulation of problems and 
their corresponding formulation in the argumentation 
reformulation.) First, given a goal G, in order to derive 
the (translation (A(G) of the) goal in the underlying 
monotonic logic, a preliminary step needs to compute 
not only a set of argument rules Sq, but (possibly) also 
a set action facts Ao C H where 

H = {Happens At(F,T)\ F is a fluent constant and 
T is a time-point} 

Aq can be seen as a preliminary plan for the goal, that 



needs to be extended first to a weak plan and then to 
a safe plan. Every time a new action fact is added to a 
plan, any preconditions of the action need to be checked 
and, possibly, enforced, by adding further action facts. 
The computation of safe plans from weak ones requires 
blocking, if needed, any (weak) plan for the complement 
of any literal in the goal. 

The following is a high-level definition of the £- 
Planner in terms of the argumentation-based re- 
formulation of the ^-language: 

Definition 5 Given a planning domain (D, P) and a 
goal G, an £ -plan for G is a set h-propositions A £ such 
that A £ = {A happens-at T\HappensAt(A,T) e A}, 
where A C Ti is derived as follows: 

1 ) Find a set of arguments Sq C As' and a set of action 
facts A C H such that B(D) U A U Sq H G; 

2) Construct a set of arguments S C As and a set 
of action facts A w C Ft such that (i) Sq C S and 
Ao C A w , and (ii) S is an admissible set of ar- 
guments wrt the augmented argumentation program 
P £ (D' UA W ) = (B(D , )uA w ,As,As',<£), where 
D' = D U {A(C(T))\A needs C e 
P, Happens At(A,T) G A w }. 

3) If every assumption in S is a sceptical non-monotonic 
consequence of the augmented argumentation pro- 
gram Pe(D' U A w ) then A = A w . 

4 ) Otherwise, A is a set of action facts such that A w C 
A and: 

4-1) For every set of arguments R C As' such that 

B(D) U A U R I <G, where ->G stands for the 

complement of some literal in G, there exists no 
R' C As such that R C R' and R' is admis- 
sible wrt the augmented argumentation program 
P £ {D" U A) = (B(D") U A, As, As', < £ ), where 
D" = D U {A(C(T))\A needs C G 
P, Happens At(A,T) G A}. 

4-2) There exists a set S' C As such that S C S' and 
S' is admissible wrt Ps(D" U A). 

In the first two steps, the ^-Planner computes a weak 
plan for the given goal. If this does not depend on any 
assumptions (step 3) then it is a safe plan, as no plan 
for the complement of the goal is possible. Otherwise 
(step 4) , the planner attempts to extend the weak plan 
in order to block the derivation of the complement, ->G, 
of the goal. In order to do so, it considers each possi- 
ble set of arguments, R, which would derive ->G (in the 
augmented background theory) and extends the plan so 
that R can not belong to any admissible set of the re- 
sulting theory. A successful completion of step 4 means 
that the weak plan A„, has been rendered into a safe 
plan A. 

The correctness of the planner is a direct consequence 
of the correctness of the argumentation proof theory on 
which it is based. 

Theorem 1 Given a planning domain (D, P) and a 
goal G, let A w be the set of action facts computed at 
step 2 in definition 0. Then, the set 



A^={yL happens-at T\Happens(A,T) 6 A^,} 
is a weak plan for G. 

Proof: As every admissible set of arg uments is con- 
tain ed in som e maximally admissible set (Kakas, Miller 
& Tsni 1999) and, as discussed in section , every maxi- 
mally admissible extension of the argumentation-based 
reformulation of a domain description in £ corresponds 
to a model of the original domain, S computed at step 
2 in definition || corresponds to a model M of D' U 
entailing G. Because of the way t-propositions in do- 
mains are handled, as additional conjuncts in goals, this 
implies that M satisfies all preconditions of actions in 
Af,. Thus, M is a model of D U Af, such that M \= G 
and M \= P and the theorem is proven. 

The following theorem can be proven in a similar way: 

Theorem 2 Given a planning domain (D, P) and a 
goal G, let A £ be an Z-plan for G. Then, A £ is a safe 
plan for G. 

The high-level definition of the ^-Planner given above 
in definition [s] can be mapped onto a more concrete 
planner by suitably exten ding the argumentation-base d 
proof theory proposed in (Kakas, Miller, & Toni 1999). 
Ao can be computed directly while computing So, by an 
abductive process which reasons backwards with the 
sentences in the background theory. Also, one needs to 
define suitable: 

- extended successful derivations, for computing 
incrementally A w from Ao at step 2 and the final A 
at step 4.2 from the extension of A w computed at 
step 4.1; 

- extended finitely failed derivations, for comput- 
ing incrementally the required extension of A^, at 
step 4.1. 

As the original derivations, the extended ones incremen- 
tally consider all attacks against the root of the trees 
they build and augment the root so that all such at- 
tacks are counterattacked, until every attack has been 
counterattacked (successful derivations) or some attack 
cannot be counterattacked (failed derivations). Again, 
all nodes of trees are sets of arguments. 

In addition, both new kinds of derivation are inte- 
grated with abduction to generate action facts so that 
success and failure are guaranteed, respectively. The ac- 
tion facts are chosen to allow for counterattacks to exist 
(successful derivations) or for additional attacks to be 
generated (failed derivations). Thus, extended success- 
ful derivations return both a set of argument rules in 
As' and a set of action facts, wherever extended failed 
derivations just return a set of action facts (the ones 
needed to guarantee failure). 

Both kinds of derivations need to add to the given 
background theory (domain) the t-propositions that are 
preconditions of any abduced action. By the way t- 
propositions are handled, this amounts to extending 
dyn amically the given goal to prove or disprove, respec- 
tive y 



Finally, both kinds of derivation require the use of 
suspended nodes, namely that could potentially at- 
tack or counterattack their parent node if some action 
facts were part of the domain. These nodes become 
actual attacks and counterattacks if and when the ac- 
tion facts are added to the accumulated set. If, at the 
end of the derivations, these action facts are not added, 
then suspended nodes remain so and do not affect the 
outcome of the derivations. 

Let us illustrate the intended behaviour of the ex- 
tended derivations with a simple example. Consider 
the simple "car engine" domain D' c in section , and 
let G = HoldsAt(Running,Tf) for some fixed fi- 
nal time Tf. We will show how the safe A w = 
{Happens At{TurnOn,T\){, with T\ -< Tf, is com- 
puted. 

1) S = {PG[Running,T f ;Ti],PA[Petrol,Ti]} and 
A w = Happens At{TurnOn,T\) , with T\ -< Tf. 

2) The only possible attack against So is 
{NA[Petrol,Ti]}, which is trivially counterattacked 
by Sq itself. Thus So is admissible. 

3) Let us examine the only assumption PA[Petrol, T{\ 
in Sq, and try to prove that it holds in all admissi- 
ble extensions of the given domain extended by A„, . 
Consider the complement NA[Petrol, T±] of the as- 
sumption, and let us prove that it holds in no admis- 
sible extension. This can be achieved by an ordinary 
finitely failed derivation (without abducing any addi- 
tional action fact in order to do so) , as there is an at- 
tack, {PP[Petrol,Tx;i\, PA[Petrol,l]}, against the 
above complement, which cannot be counterattacked. 

Thus, A w is a safe plan for G. Consider now the domain 
D" = D' c — {(D c 4)}. Then, step 3 above fails to prove 
that the given assumption is a sceptical non-monotonic 
consequence of the augmented domain, and thus A w is 
just a weak plan. 

4.1) —iG = ->HoldsAt(Running,Tf) is derivable via 
R — {NA[Running,Tf]}. This is attacked 
by {PG[Running,T f ;Ti], PA[Petrol,T!]}, which 
is counterattacked by {NA[Petrol, Tx]}, which, if 
added to R, would provide an admissible extension in 
which -iG holds. An extended finitely failed deriva- 
tion can be constructed to prevent this as follows: 
the extended root R U {NA[Petrol, Ti]} is attacked 
by {PG[Petrol,T 1 ;T2\} if A w is augmented to give 
A = A w U {Happens At{Fill,T 2 )}, with T 2 -< T x . 
As the new attack cannot be counterattacked, the 
derivation fails. 

Thus, A is a safe plan for G (for simplicity we omit step 
4.2 here). 

Note that the method outlined above relies upon 
the explicit treatment of non-gro und arithmetical con- 
straints over time-points (see (Kakas, Michael, & 



Mourlas 199S)) 



Incomplete planning problems 

In this section we will illustrate through a series of ex- 
amples the ability of the £ — Planner to produce safe 
plans for incompletely specified problems. In particu- 
lar, we will consider problems where the incompleteness 
on some of the fluents is such that it cannot be affected 
by actions and hence our knowledge of them cannot be 
(suitably) completed by adding action facts to plans. 

Let us consider again the "vaccine" domain D v at the 
end of section , and the goal G = {Protected holds-at 
Tf}, for some final time Tf. We will show how, in this 
example, the ^-planner reasons correctly with the "ex- 
cluded middle rule" to produce a safe plan for G, de- 
spite the fact that it is not known whether the blood if of 
TypeO or not. A weak plan for the goal is given by A w — 
{HappensAt(InjectA,Ti)} with Ti -< Tf. Indeed: 
given S = {PG[Protected, T f ; Ti], PA[TypeO, Ti]}, 
B{D V ) U So U A w h G, So is admissible (steps 1 and 
2). The plan A„ is conditional on the set of as- 
sumptions {PA[TypeO,Ti\} (step 3). Note that there 
is no action that can affect the fluent TypeO, so A w 
cannot be extended so that it can derive the as- 
sumption. Nevertheless, we can extend A^, to a safe 
plan A by constructing failed derivations for -^G = 
{^HoldsAt(Protected,Tf)}, as illustrated below (step 
4.1). 

The only way to derive —>G is by means of the set of 
arguments R\ = {N A[Protected, Tf]}. This is attacked 
by So itself, which can be counterattacked (only) if the 
root i?i is extended via the assumption NA[TypeO, Ti]. 

The initial root R\ and thus the extended 
root i?2 are attacked by the set of arguments 
{PG[Protected,T f ;T 2 ],NA[TypeO,T 2 }} with T 2 -< 
Tf, provided we add to A w the action fact 
Happens At(InjectB,T 2 ) to give a new plan A = 
{HappensAt(InjectA, Ti), Happens At (Inject B, T 2 )}. 
In order to successfully counterattack the new attack 
we need to add further to the root the argument 
PA[TypeO, T 2 ], obtaining a new root R 3 . 

R 3 is (newly) attacked by {PP[TypeO, Ti; T 2 ], 
PA[TypeO,T 2 }}, if T 2 -< Ti, and by {NP[TypeO, T 2 ; Ti], 
NA[TypeO, Ti]}, if Ti -< T 2 . (Note also that neces- 
sarily Ti 7^ T 2 as otherwise -R3 would attack itself.) 
These attacks can only be counterattacked via one 
generation rule for ->HoldsAt(TypeO,Ti) and one for 
HoldsAt(TypeO,T 2 ), respectively. But no such genera- 
tion rules are possible. 

This concludes the construction of the only required 
finitely failed extended derivation for -iG. (Again, we 
omit step 4.2.) The computed £-plan A is indeed a safe 
plan for G. Note that no p-propositions are present in 
this example and thus no extended domain is generated. 

The following example illustrate the use of observa- 
tions (in the form of t-propositions) to provide some 
(partial) implicit information on the domain. Consider 
the domain Df. 

InjectC initiates Protected when {TypeA} (-Djl) 
InjectD initiates Protected when {Weak} (T>i2) 



Bite initiates Infected when {TypeA} (T>i3) 

Expose initiates Infected when {jyeafc} (T*i4) 

-^Infected holds-at 1 (-0,5) 

Infected holds-at 4 {Dfi) 

and the goal G — Protected holds-at Tf, for some final 
time 4 -<Tf. The fluents TypeA and Weak are incom- 
pletely specified, but the observations (t-propositions) 
essentially give indirectly the information that cither 
TypeA or Weak must hold. This then allows, similarly 
to the previous example, to generate a safe plan for G 
by applying both actions InjectC and InjectD. 

A weak plan A„ = {Happens At(InjectC,T\)} 
with Tl -< Tf is first generated (steps 1 and 2), 
conditional on the assumption set {PA[TypeA, Ti]} 
(step 3). Then, step 4.1 generates R = 
{N A[Protected,Tf]}, proving ->G. R is attacked 
by {PGlProtecte^Tf-T^PAlTypeA,^}} and can 
only be defended if R is extended to R' = 
R U {N A[TypeA,Ti]} , which is admissible. How- 
ever, R' needs to be extended to confirm the t- 
propositions. To confirm (T)j5), the assumption 
NA[Infected, 1] needs to be added to R' . More- 
over, to confirm (Di6), we need to add cither 
i?i = {PG[Infected,T f ;T 2 ],PA[TypeA,T 2 }} or R 2 = 
{PG[Infected,T f ;T 2 },PA[Weak,T 2 }}, with T 2 -< 4. 
But adding the fist such set would render the result- 
ing set non- admissible (as both PA[TypeA,T 2 ] and 
NA [TypeA, Ti] belong to it). Hence, the only vi- 
able extension is R" = R' U R 2 U {NA[Infected, 1]}. 
This set is admissible (if we also abduce the action 
HappensAt(Expose,T 2 )). To prevent that, we find 
an attack that cannot be counterattacked successfully: 
{PG[Protected,T f ;T 3 },PA[Weak,T 3 }}, T 3 -< T>, ex- 
tending A w to A = A w U {Happens At(InjectD,T 3 )}. 
This attack can only be counterattacked by adding 
the assumption NA[Weak, T 3 ], rendering the root self- 
attacking (as PA[Weak,T 2 ] belongs to it). Thus, A is 
a safe plan. 

The next example shows how the £ -planner exploits 
ramification information to generate safe plans for in- 
completely specified problems. Let D r be: 

InjectB initiates Protected when {-*TypeO} (D r \) 
InjectE initiates Protected when {Strong} (D r 2) 
Strong ■whenever {TypeO} (-D r 3) 

and G = Protected holds-at Tf, for some final 
time A ^ Tf. The fluents TypeO and Strong are 
incompletely specified. The ramification statement 
requires that either -1 TypeO or Strong must hold 
(at any time). Then, similarly to the above ex- 
amples, the £ — planner can generate the safe plan 
{Happens At(InjectB , Tl), Happens At(InjectE , T 2 )}, 
with T\,T 2 -<, Tf. During the computation of this plan, 
to render the only proof of ->G non-admissible we gen- 
erate, in addition to the attack given by the weak plan 
{Happens At(InjectB , Tl)} , an extra attack by adding 
the action Happens At(InjectE,T 2 ). The first attack 



can only be counterattacked by {PA[TypeO, TJ} and 
the second only by {NA[Strong, T 2 ]}. As these as- 
sumptions persist, it is not possible to satisfy the ramifi- 
cation statement at any time between T± and T%. Hence 
there is no admissible extensions that can prove ->G and 
the plan is safe. 

Conclusions 



We have shown how we can formulate planning within 
the framework of the Language £ and have used the ar- 
gumentation reformulation of this framework to define 
a planner that is able to solve problems with incomplete 
information. 

A planner with similar a ims has been defined in 
(Finzi, Pirri, & Reiter 1999). Both this planner and 
our ^.'-planner regress to a set of assumptions which, 
when entailed by the incom plete theory, guarantees the 
plan to be safe. However, ( Finzi, Pirri, fe Reiter 1999 ) 
uses a classical theorem prover to check explicitly this 
entailment at the initial situation (in general, the re- 
quired entailment is the non-monotonic entailment of 
the action framework in which the planning problems 
are formulated). Instead, in the £ -planner the incom- 
pleteness, and hence the assumptions to which one re- 
gresses, need not refer to the initial state only. More- 
over, the ^-planner uses these assumptions in the com- 
putation of the initial possibly weak plan and then to 
help in the extension of this to a safe plan. We are 
studying other planning algorithms (within the same 
argumentation formulation of the language E) which 
use more actively the assumptions on which weak plans 
are conditional. One such possibility is to try to extend 
the plan so as to re-prove the goal but now assuming 
a-priori the contrary of these assumptions. The search 
space of this type of planning algorithm is different and 
comparative studies of effeciency can be made. 



(Smith & Weld 1998) introduce the notion of confor- 
mant planning for problems with incomplete informa- 
tion about the initial state and for problems where the 
outcome of actions may be uncertain. Our safe plans 
correspond to conformant plans for problems of the first 
type. The emphasis of this work is on the development 
of an efficient extension of Graphplan to compute con- 
formant plans, by first considering all possible worlds 
and, in each world, all possible plans, and then extract- 
ing a conformant plan by considering the interactions 
bet ween these differ ent plans and worlds. 

( Giunchiglia 200C ) considers the problem of planning 
with incomplete information on the initial state within 
the action language C. This is an expessive language 
that allows concurrent and non-deterministic actions 
together with ramification and qualification statements. 
Our safe plans correspond to the notion of valid plans 
which in turn are conformant plans. To find a valid 
plan, the C-Planner generates a possible plan and then 
tests, using a SAT solver, whether the generated plan 
can be executed in all the possible models. Possible 
plans can be seen as weak plans, but we allow in the 



"testing phase" for the dynamic expansion of the weak 
plan into a safe plan. 

A general diffe rence with both ( Smith fc Weld 19"9§| ; 
Giunchiglia 2000) is that the ^-planner is goal-oriented, 
with an active search for actions both for the satisfac- 
tion of the goal and for ensuring that the generated plan 
is executable in any of the many possible worlds for the 
problem. 

Another difference is at the level of expressivenes, in 
that we allow observations (not only at an initial state) 
and can exploit indirect information given by them to 
help handle the incompleteness. 

We are currently developing an implementation of 
the ^-planner based on an earlier implmcntation of the 
language £ and aim to carry out experiments with stan- 
dard planning domains. In this initial phase of our 
study we have not considered efficiency issues, concen- 
trating specifically on the problem of planning under 
incompletness. In future work we need to address these 
issues by studying the problem of effective search in 
the space of solutions. One way to do this is to con- 
sider the integration of constraint solving in the plan- 
ner as in constraint logic programming and its exten- 
sion with abduction (Kakas, Michael, fc Mourlas 1998] ; 



Kowalski, Toni, fc Wetzel 1998] ). 

We are considering several extensions of the £- 
planner to allow for more general plans e.g. contain- 
ing non-deterministic actions (or actions with uncertain 
effects). These extensions would require correspond- 
ing extensions of the expressiveness of the Language £ . 
Also, the extension of the ^-planner to accommodate 
sensing, in the form of accepting further observations 
(t-propositions) in the problem description, is a natu- 
ral problem for future work. 
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